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In these days of increasing costs, the Navy aust ensure 
it is recruiting and retaining the best possible personnel. 
Enlistees should not only have the capacity to do the work, 
but also the desire and drive to accomplish the assigned 
tasks. Currently, there are few measures of effectiveness 
which indicate how an enlistee will perform in the Navy. 
The SCREEN score, developed by Dr. Robert Lockman of the 
Center for Naval Analyses, predicts attrition of first tern 
mochultS ,aS 4 fEunction of test score and biographic data 
il. It does aot, however, specifically address the 
performance of a member while in the Navy; cather it 


predicts survival in the Navy. 


The advancement to 58-4 is a possible nreasure of 
effectiveness with which a3 member's utility to the Navy 
could be judged. First, the member must display certain 


a 
amount of initiative and irive to prepare for the e@xan. 
Second, pasSing the test 1S an indication that a member has 
reached a basic Level of knowledge within a specific rating. 
It is at this point that the Navy begins to get a return on 
its investment in the member. The £-4 test, therefore, may 
be used aS a measure of Doth a member's capacity and desire 


BO perform in the Navy. 


This thesis explores the differences between thea £-4 
test passers and aon-test passers pased on the individual's 
Dlographic and demographic data. One would like to know 
@hat variables in a member's background are associated with 
his ability and desire t> pass the @xam. Specifically, an 


Ordered list of determining factors is desired. Also, it is 





important to learn if tae determining factors are identical 
mor ali tatings or if they vary from rating to rating. 
Finally, it would be interesting to discover how length of 
time in service and time in rate are related to a2 member's 


tendency to advance. 


For the purposes of this analysis, a test passer is 
defined as a member who has taken the E-4 exam and received 
a score Which caused him to be advanced to E-4. A non-test 
passer is a member who has either takan the E-4 axam and did 
not receive a high enough score to be advanced or who did 
not take the exan even though he was eligible. Most of the 
mon-test passers in the study are naon-test takers. The 
separation of th? non-test takers from the non-test passers 
was not possible due to the lack ot data. The reader will 
need to keep these dafinitions in mind while reading the 
text. For clarity, a gfaphical presentation of these 


definitions may be found in Figure 1 in Chapter III. 


The data used in this study com2 from all personnel 
eligible to take the E-4 exam in August 1977. A member was 
considered eligible if ha had at laast one year in the Navy 


and at least six tonths is an £E-3. 


Two major results were obtained from this study. First, 
the use of non-linear discrimination resulted in correct 
Classification of 60 - 70% of the members of each saapis 
eXamined. Second, hierarchical clustering was applied to 
group the ratings (especially those with small aumbers of 
mempers). These groupings may havea general use peyond those 


Of this paper. 





There are many possible factors which aay affect 3 
member's desire and ability to pass the E-4 exan. EXxamplas 
of these ASee Vedra 2E- Educarlon, Job Satisifaction, 
leadership of Superiors, sea/shore assignment, :3tc. Data 
for some of the aore appsaling of taesa was unavailable for 
the study. The factors used in this analysis wars all drawn 
from the August 1977 aonthly Enlisted Master Record (EMR) 
Tile located at the Naval Military Personnal Center in 
Aclington, Virginia. The month of August was chosen because 


it cOlncidad with a agonth in which the E-4 exams were given. 


The EMR ais a 3000 character record woich is maintained 
on each active duty and reserve enlisted member of the Navy. 
Such items aS social Security nuabec, pay antry base date, 
Schools attended, and test scor2s are stored on 2ach record. 
gach cecord is 3a condensed version of a member's personnel 


jacket. 


To maxe@ tne data easier tO manipulate, the 3000 
Bearacter record file was reduced to a 150 character record 


Mele, Each record contained the following demographic data: 


Age2 

Sex 

Race 

mien yolebves Sopa e) Fhe) 
Hom2 32f recocd 
Dependants 
Time in rate 


Active duty/Reserve duty indicator 





Length of Service 

ERawObwaecrive Cuty Obligation date 
Enlistment term 

Years of education 

EduUcac Tog CertelLE ication 

A-SchOOL/NO A-School indicator 

Special test scores 

SCREEN score 

Mental aptitude test scores (ASVAB or BIB) 


The spacial test scores measure aptitude in the areas of 
sonar, eiectronics, and radio, Phe §SCReeN sec 
measure or a recruit's chances of completing the 
years of his enlistment. The mental aptitude tests are a 
measure of the member's overall intelligence. The Basic 
Test Battery (BIS) consists of five exams in the areas of 
general intelligence, nunerical raasoning, and nechanical, 
clerical, and shop aptitude. The Armed Services Vocational 
aptitude sBattery (AS VAB) contains sixteen tests measuring 
Specific areas of intelligence and aptitudes, including the 
Ones in the 3TB. Unfortunately, the ASVAB had not deen 
given to a large enough sample of the population to allow it 


to be used in the analysis. 


From the data set, tne individuals were extrasted whose 
length of service Was at least one year, and whose time in 
Cate was at least six months. These were considered the 
members eligible to take the E-4 exam. Since completion of 
correspondence courses and practical Ea@ cours 1s one 
ceitlected in the sMR, it was not possible to check these 
eligibility requirements. However, it was assumed that if a 
menmoer fulfilled the tine reguirements and desired to taxe 


the exam, ne would have his other requirements completed. 


The variables of the data ware originally in three 





scales; binary, nominal, and interval. Nominal data are 
data in which numbers are used as labels or "names", e.g., 
Black=1, Chicano=2, White=3. Binary data were a special 
form of the nominal where only two responses are available. 
Interval data include all continuous Variables where 
differences between scores are meaningful. For example, 
psychological test data are treated as interval data. All 
or the nominal data were converted to binary values in order 
to meet the assumptions of the analysis. For instance, the 
aducational Gerrit ication was transtormed from the 
Cesponses: no degree, GED, high school diploma, Bachelor's 
degree, and postgraduate degree, t29 hign school diploma or 
no high school diploma. The GED and college dagree holders 


were grouped with the high school diploma holders. 


Some variables were not used in parts or all of the 
analysis oecause it was found there were not enough members 
in the sample with values for these variables. For example, 
sex could not be used in analyzing the Boller Technician 
rating pecause there were no female gemo2rs in the sanple 
population. Lixewise, tha ASVAB Scores could aot be used 
pecause not enough of the sampl2 population had taken this 
test. Some of the other determining factors wera not used 
pecause it was decided at the outset that the variable could 
not be used in policy decisions. dome of record is an 
example of this type of variable. All variables used in tne 
analysis with a description of their types and ranges are 


listed in Appendix A. 
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Due to the non-availability of data listing all test 
takers and test passers for the August 1977 cvcle, 4 
comparison of the Augast 1977 EMR and the August 1978 EMR 
Was necessary to iéetermine who passed the exan. The E-4 
exam is given only twice a year and all test passers from a 
given cycle who are advanced to E-4 are advanced from three 
to mine months after the cycle. Therefore, if a member was 
eligible to take the exam according to the August 1977 file 
and was promoted betweei December 1977 and May 1978, he was 
classified as a test passer. If he was eligible to take the 
exam in August 1977 but he was not promoted during the 
advancement period, he was classified as a non-test passer 
regardless of whether or not he took the exana. This is 
lllustrated in the Figure 1. Finally, if he was 2ligible to 
take the exam in August 1977 and was not in the August 1978 
file, ha was listed as out of service pecause it was not 


possible to determine if he passed the exam or not. 
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SHADED AREA 
INCLUDES ALL 
NON-TEST PASSERS 
AS DEFINED ABOVE 


PASSERS 


\ 


Graphical Representition of Test Passers and Non-test 






NON-TEST 
TAKERS 


Passers 


Figure 1. 


Once the groups were determined, it was possibl2 to 
evaluate the data by 2ither of two methods. First, since 
the out of service member was in the Navy at the time the 
test was given, he was actually a test passer or a non-test 
passer. The assugption would be made that the reason he 
left the service was that he complated his time in service 
requirements. fherefore, time alone would distinguish tae 
Out of service nember from the test passer and tne non-test 
passer. If all factors involving time in the Navy such as 
length of service and time in rate are ignored, then the 
test passer and non-test passer groups may be Looked on as a 
Candom sunset of tne original test passer / non-test passer 
f out of service data. fhis gives just two all inclusive 


groups as they were found on the day of the test. 


Unfortunately, one is forced to ignore military tiae related 


Factors [variables 13, 14, and 15, Appendix A] which might 








prove to be useful. Evea so, this approach might well be 
considered useful in evaluating enlistees who as yet have no 


time in service. 


The second method allows the inclusion of military tine 
dependent factors. In this approach the data are viewed at 
the test cycle plus one year position {August 1978). From 
this time perspective, a nember may fall into only one of 
the three groups; test passer, non-test passer, or out of 
service. Since all membecs are inciuded in this approach 
and no assumptions of random subsets are nade, the time 
dependant factors may be used. Although classification of 
out of service personnel may at first seem extranzous to the 
problem, it is useful knowledge, and may in fact be as 


important as identifying the test passers. 


Both of the above methods of grouping may be applied at 
Various levels of the VYavy's structure. First they may be 
used at the all Navy level. Here all E-3'S who meet the 
time requirements for E-4 would be considered, regardless of 
rate. Next, the individual ratings may be investigated. 
Before a member may take an E-4Y exam in a particular rating, 
he must complete certiin study and practical factors 
requirements for that crating. Members in th2 process of 
doing this are refered to as designated strikers in that 
Gating and are identified on the EMR fila. By analyzing 
Only the designated strixers in a specific rate, it may be 
possible to predict the test passers for that cate. 
Finally, for the sample available, certain ratings had too 
Tew designated strikers to give any conciusive r2sults. To 
handle these cases, hierarchical clustering methois could be 
used to group ratings according to similarity of specified 
Metributes of the ratings. Each group of ratings could thea 


be subjected to analysis. 











IV. APPROACHES TO ANALYSIS 


oe ee ee ee ee ee ee ~~ _—— ae ote ee 


The data used in tbis analysis consisted of fifteen 
variables per member. fhe intent was to divide th2 members 
Into groups according to these variables. Hence, the use of 


some form of agultivariate analysis was dictated. 


The intent of the analysis was two~fold. First, it vas 
necessary to find out if a differance existed between tae 
groups in terms of the availabls data. S2cond,. bron a 
difference did exist, WaS 1t possible td obtain a 
classification scheme which would correctly classify an 


acceptably high parcentags of the members? 


Discriminant analysis is one method of performing the 
desired analyses. Discriminant analysis 1S a gaultivariate 
Statistical technigue used for constructing decision rculss 
by which data units ({anlisted members) may be assigned to 
groups to which they nave the greatest resemolance. These 
decision cules are statistical functions. Thea tadependent 
Variables in tha Functions are the attributes of the member. 
This analysis is valid for both the two group and the three 


group case. 


There are three basic assumptions underlying 
Biscriminant analysis [2]. First, the groups must be 
discrete and identifible. Both grouping methods defined in 
the last section mest this reguirement. Second, each 
Opservation in each group can be described by a set of au 
Varlapbles. This condition 1S met py use of the fifteen 
determining factors. Pinally, all u variables are assumed 


Mo have a multivariate -adcmal distribution. NA) Sieh ay egal = 
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interval data for this study were unimodal and symmetric. 
The data were judged to maave a close enough Approximation to 
qa normal distribution to meet this assumption. The binary 
data, of course did not. However, Teeoraing tO GLlilbert |3\, 
binary data may be used with little loss of discriminating 
power. This is because the rotations of the axes which 
occur in discciminant analysis cause the binary data to 


appear to be binary noo longer. 


As an example, consider a two space problem of binary 


VS. interval data: 





BEnarye Vse Intec va tl Data 


Figure 2. 


ii the avis iS cotated to obtain a discriminant 
function, the data no longer appear to be binary vs. 
interval, but interval vs. interval as shown in the 


following figure. 








Transformed Binary vs. Interval Data 


Figure 3. 


Given that all the basic assumptions for the use of 
discriminant analysis ace Satisified, the type OF 
discriminant Ednet On aust be chosen. de the 
Varlance-covariance matrices of the groups are equal, a 
linear function may be used. If they are not equal tne 
better discrimination 15 achieved by 2 guadratic function. 
A two-dimensional illustration of this 1S shown in the 
following diagraan. Tha inequality of the covariance 
Matrices 15 reflected by tne elliptical contours of constant 


density. 
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Quadratic vs. Linear Discriminating Functions 


Figure 4. 


The quadratic function above misclassifies only one data 
point, whereas the linear function aisclassifies five 
points. when given a choice, even ple the 
Varlance-covarlance @fatrices are equal, it 1S wise to choose 
Phe Quadratic function. Phe quadratic should always perforn 


at least as well as the Linear function. 


AS was stated in Chapter III, some ratings do not have a 
Barge enough sample in any testing cycle to allow one to 
conduct a meaningful analysis. To cesolve this problea, 


cluster analysis das used to group the Latings. 


There are several methods of cluster analysis availaolse. 
The particularc type used in this study is termed 
hierarchical clustering. FOr this method, the data 
attributes of each entity (rating) are investigated to 
determine which two entities are the most alixe [4]. These 
two entities are clustered together to form a naw entity and 


an averaged set of variables is computed for the newly 


da 





formed cluster. fhis cycle 1s repeated until there is one 
final cluster containing all of the original entities. The 
results can be represented as a tree with all the original 


units on the left and the single cluster on the right. 


Hierarchical Clustering Tree 


Figure 5; 


The major advantage to the &ierarchical clustering 
metnods is that they allow one to view the overall 
relationships among the data units. The tree diajram shows 
the racial clusters which occur and the degree of 
Slmilarity at the end of gach clustering cycle. From this, 
the analyst is £r3e to choose the optimal number of clusters 


for his purposes. 
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Discriminant analysis was applied to the data to 
determine if a difference existed between the group means 
and, if so, to construct decision rules to place members in 


the proper groups. 


The first step in the analysis was to determine if the 
vatlance-covariance matrices of the groups were egual. The 
results of this dictated whether or not to use a linear or 
Quadratic discrininating function. In this analysis, every 
Sample produced unequal Variance~-covarianc2 matrices. 


Therefore, the quadratic function was used in every case. 


Next, the data frou each group were investigated to 
determine if a difference existed between the means of the 
groups in terms of the variables being investigated. The 
derivation of the two group cass is presented in the 
following paragraphs. The three group case is found in 


Appendix B. 


Por this analysis, the groups were chosen to be of egual 


size (N = _ = I and the varlance-covariance matrices are 


assumed to be unegual. N observations from each af the test 


passer and non-test passer groups may be represented in 


th 
vector notation as follows, where the oa Observation from 
each group 1S a 1xm vector 
x = (x gf eo 2 8 r& ) 
n mit m,n 
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where n=1, ... ,N and @ is the number of variables in each 


observation. M and a ace also vectors of length ao. 


= eae >» :} 

ls a ea Garis 

The null aypothesis for this section of the analysis is that 
the means of the groups in terms of the the datermining 


factors are equal or 


Hi 
é¢ 
= 

il 
= 


mio Standard method in whivaclate stati scics EOE 
determining if the mean of a population with unknown 


Variance is eygual to a specific value 1s the t-test. Given 


4 


2 
Phat the population has a N({( u, o) distribution, the t 


Statistic 1s defined as follows 
t = (VN (x-u))/S 


The extention of the t statistic to multivariate analysis is 


2 
Mmmerr statistic which is defined as 


D -4 
cae eee SN (X= iy 


Now let y =x ~X and N=N ce y is dafined as 
; ; , 


y= MUN2Q(x -x ) = -X 
Ly! clby 71 Z 
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end the variance-covariance matrix is 


S = 1/(N-1) > (y.-¥) (y -¥)! 
pl a: 


ly (N~ 12 atx. =k te) (eee “ok tx) 
f d dee, eg ee ieee bse tf 2 


Za 
Ba2 fT statistic in terms of y is 


2 = 
oes y oy 


and the null hypothesis as stated above can be rawritten as 


2 
to accomodate T in terms of y. 


2 ; 
The object of the T tast is to find a confidence region 


aDout a and u, aS shown below. The size of the 
confidence region is jiependent on a, _ the level or 


Significance of the test. 
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Confidence Region About Po eu 


Figure 6. 
pe 


With a scaling change, the f SEACTISELC 1S disSte ibuted 


according to an F distribution with ma and N-m-1 degrees of 


freedom j|5]. 


2 m1) 
phe ae (Not) 2 F (a) 
N =—" = N—o- |] 


Therefore, if 


the hypothesis that the means are equal is rejactead. 


Once the inequality of the means has been estadlished, 
the decision rules can be constructed. Again, the two group 


Case will be dealt with here and the three group case can be 


found in Appendix Cc. 








Anderson states that in addition to the basic 
G@iscriminant function, thea classification function must taxe 
into account the a priori probability of group membership 
and/for the costs of misclassification. Furthermore, he 
States that, "the good classification scheme minimizes the 
bad effects of misclassification"™® j|5}j. AS a means of 
exploring this, let M4 be the misclassification function 


which is to pe minimized. 


Pe jo) eee a 
( T. ( T, 


“7 is the a priori probability of an observation being 


Grawn from group 4h, and Pi(gjh) 1s the probability of 


Classifying an observation aS a member of group g when it is 
metualily a member eee group oe The costs of 
misclassification could be included but are unknown and are 
not dealt with in this study. The guantity MM 1s mininized 


by the following rule: assign +o group one UE 


E (37. Ck 
Ie CS ees 


Otherwise, assign to group two. ££ (x) 1S the probability 
Mensity function of population i. Assuming aultivariate 
normal populations with unegual Varlance-covariancea 


Matrices, the above equation becomes 


72 2-1 - 4 
[(27 cr] 3 Sh WANS Cat ae 


(Ay dd TT 


U2 VJ ee es 
2 5 2 -~j/2 xX- f X- 
[(2 cr 7 rele eT a : ( ud] T 


Vv 


—_ 


@here £ 1s the variance, Taking natural logs of both sides 


and simplifying yields 
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- : 1 
eA mere) 1-1/2). (X%- ' y i= - (xX a 1—= 
Vie len | aoe aay 21 u) a HD ( =) ; ( ee 


By Cearrcanging the equation, the quadratic decision rule is 


focmed. 
-1 -1 -1 -1 -1 -1 
; Ui 2, Se 
7 2 1 1 Ze 14 1 ore ae 


de : ee 
3 a r } nf ates 


If the above equation is true, the observation should be 
assigned to group 1. Tf not, assign the observation to 


mcOoup 2. 


It is also possible to determine the contribution that 
each variable makes to the discriminant function. That is, 
zt. is possible ZO compute the relative amount of 


discriminatory power that gach variable has. 


Let V be an mxi vector of coefficients of the 1 
discriminating function. This vector is transforned into a 


scaled vector V*' by multiplying each element of v_ by the 
a 2 


Square root of the corresponding diagonal element of tae 


pooled within groups deviation sum of Squares matrix 4, 


where 
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Phe scaled vector V‘' is 


The discrimination power of each variable can now be 


expressed as 


The method of hierarchical clustering used in this study 
is due to Ward [6]. This method is pased on the computation 


Be the Euclidean ii1Stance between centroids of the entities. 


+ 


= 
Bor this discussion, let *x be the value of the 1 


ijk 


1 


og e} pa 
Variable of the j data unit in the k cluster. The aean 


Meo te 
Sr the 1 Variable in the k cluster is 
x = lfm x 


ik k ijk 


and the error sum of squares for cluster k is 


25 





The criteria for the Ward method 1s the minimization of 


the increase in the withio group error sum of squares. 


where —E and E are the within group error sum of squares 
BP q 


for the two entities which are joined to form the cluster t. 


By Simplification 


Mais function should force the entities to group into tight 
clusters. Also, this should cause the distance between the 


clusters to be at a maximum. 
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VI. RESULTS OF ANALYSIS 


The discriminant analysis discussed in this Section was 
done with the aid of computer froutineS appearing in 
Eisenbeis and Avery [2}. These routines use a complete 
stepwise procedure to guarantee tae best discriminant 
function possible from the given data. Thls procedure 
investigates every possible combination of deter mining 
factors beginning with a subset of one and continuing up to 
m factors [2]. For each subset the best combination of 
Variables is chosen. when the analysis 1s coaplete, the 
analyst is free to choose the subset with the bast function 
for his purposes. In this study, all m variables were used 


in every case. 


The analysis was conducted on six samples. The first 
was at the all Navy level wheres eligible omenbars were 
randomly selected without regard to whether they were 
designated strikers. Next, the ratings in eacth of the 
Navy's general assignment categories (Seamen, Firenen, 
Airmen, and Constructionomen) were suojected to cluster 
analysis to find which rates tended to be most alixe. These 
Clusters were formed uSing computer routines dasigned by 
Anderberg ] 4]. The variables used to determine the 
' clustering were the average individual ASVAB test scores 

from tne August 1978 cohort. These variables were chosen 
because they normally remain reasonably constant over a 
member's career. The averages were computed by summing tne 
test scores (separately for each subtest) for every E-4 
through 2-6 in 2ach rate and dividing by the number in the 
Cate. Tnoree of the clusters obtained from the Seamen 


Category were subjected to discriminant analysis. The last 
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two sample populations t9 be analyzed were individual 
Tatings: the Boiler Technicians and the Machinist's Mates. 
The following diagram depicts the analyses which were done. 
The reader may find it helpful to refer td this while 


reading the suceeding material. 


ALL NAVY 
Z 3 
GROUP GROUP 
SNeGrstR. | SNVCLSTR 2 oN -CLSTR 4 
Z 5 2 5 a 3 


GROUP GROUP GROUP GROUP GROUP GROUP 


BOLLER TECH MACHINIST'S MATE 


2 5 Z 3 
GROUP GROUP GROUP GROUP 


Discriminaot Analyses Conducted in this Study 


Prguec: 7. 


For the all Navy sample, 300 members ot each group, 
1.e@., test passers, non-test passers, and out of service 
personnel were candomly selected from the eligible pool. 
First, the two group case, test passers vs. non-test passers 
was analyzed. After checking the variance-covariance 
Matrices and finding them to be unegual, a comparison of tune 
group means waS made. The test for equality of group aeans 


Produced an F Statistic 9r 3.83 which rejected the null 
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hypothesis at the 0.01 Level of significance (see Appendix 
D). The variables in this sample which provided the 
greatest power of discrimination were race (24.5%), high 
school diploma (20.3%), and dependents (13.59%). The 
analysis indicated Caucasian members had a higher 
probability or passing the exam then their non-Caucasian 
counterparts. Individuals having a high school diploma 
and/for having dependents also were more likely to pass tne 
exam. A complete list of the determining factors and the 
amount of discriminating power for each sample population is 


listed in Appendix &. 


Once the discriginant function was computed, tne 
Classification of each member was done using a quadratic 
function as described in Thapter V. The rate of "hits" of 
correct classification for the all Navy two group sample was 
62% (see Appendix H). This classification rate includes the 
effects of the a2 priori probability or group gembership as 


do all other classification rates in this study. 


The three group case for the all Navy sample was also 
investigated. Here the aqjuality of the group asans was 
rejecteqd at the 0.01 level of significance. In this case 
Phe length orf time remaining in service (ZAOS), 32.0%, and 
length of service, 12.27%, Proved to be thewmest powerful 
determining factors. 30th of thase variables discriminated 
best between the out of service jgroup and the other two 
groups. This is probably because a member with one year or 
less remaining in his obligation is not likely to take the 
exam if he is planning to leave the service, because the 
rewards he will reap are not worth the effort to prepare for 
the exam. In this sample the rate of hits was 63.0%, which 


Was Sligatly better than the two group case. 


Clustering analysis of the seamen rates resulted in four 


Clusters (see Appendix F). The ratings tended to group by 
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job skills: the electronics rates in one cluster, the 
administrative rates in another, etc. Three of the clusters 
were subjected to discriminant analysis. The first cluster 
investigated was cluster four in Appendix PF. The test for 
equality of group means generated an F statistic of 1.99 
which was significant at the 2.6% level. The Largest 
discriminating variables in this sample were years of 
education (21.77%),  cace (15.0%), the numerical reasoning 
section of the BTB (13.7%), and the clerical aptitude 
Section of the BIB (10.464). For this sampl2 67% of the 


observations were correctly classified. 


Cluster four was also subjected to the three group 
discriminant analysis. Here the F statistic was 6.25 and 
Was Significant at the aaa level. AS in the all Navy 
group, length of time remaining in service was the largest 
discriminator (29%), but was followed by the BIB test for 
shop aptitude (12.09%) and time in service (8.24%). The 
three group sample was correctly classified 59.0% of the 


time. 


The Seamen Cating cluster two, two group sample, was 
analyzed next. The test for equality of group aeans was 
Significant at the 12% level, indicating that the means of 
the groups were close to one another. Since the means were 
SO close, the discriminating variables and the 


mLasSitication rates should be viewed with cara. 


The three group case of the Seamen cluster two did not 
test as having equal means and classified 63.04% of the 
Observations correctly. Even so, only 43% of the non-test 


passers were correctly classified. 


The Se@amen cluster one consisted mostly of electronics 


ratings. The two group test of equality of group means 


30 








produced an F statistic which was significant at the 45% 
level. This is hard to accept until one realizes the entry 
reguirements for these cates are very specific. Most 
entrants have a high school diploma, at least twelve years 
of school, have gone to A-school, etc. In short, the 
cluster iS sc homogeneous in terms of the available data 


that it was not possible to discriminate between the groups. 


The three group case faired somewhat better as with the 
other clusters. The test for equality of the means was 
rejected and the classification was correct 69.0% of the 
time. However, it must be noted that only 38% of the test 


passers were correctly classified. 


The test of group neans for tae Boiler Technicians two 
group sample resulted in an F Statistic which was 
Significant at 94%. Because of this, the classification 
results should be used with care. Even though the three 
group analysis showed the means were not 2gqual, the 
percentage of correct classifications was only 50.0%. Again 
1t waS this good only because of tne addition of the out of 
service group. The distinction between the test passers and 
mon-test passers was not very good, with only 28.0% of the 


non-test passers being correctly identified. 


The Machinist's Mat2 rating two group case produced 
surprising results when the test of the equality of tne 
group means produced an F statistic which was significant at 
0.36%. Considering the results of tae other sanaples, one 
would have expected the significance to be much larger. The 
most powerful determining factors for this case were the BI3 
tests for general intelligence (29.91%) and mechanical 
aptitude (26.444). Of the total sample, 66% was classified 


morrcectly. 


The taree group test produced an F statistic which was 


31 











-26 ; — . 
Significant at 10 - [his sample showed time r2maining in 


the Navy to be the most powerful determining factor at 
18.05%. Of the total sample, 664 was correctly identified 


uSing the decision rules. 


From the above results it can be seen that the means in 
most of the two group samples are too close together to 
permit any distinct discrimination. This is also true for 
the three group Samples because, although the out of service 
group is distinct enough to cause the equality of group 
means test to fail, the test passer and non-test passer 
groups remain as indistinct as pefore. The fact that the 
all Navy sample tested as not having equal means, while all 
other samples did, appears to be due to the variability of 


the data in the all Navy case. 
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VII. CONCLUSIONS 


Discriminant analyses and cluster analyses were used in 
this study as tools to attempt to determine if there was a 
difference between E-4 test passers and non-test passers in 


terms of the available biographic data. 


The study has shown with a high level of certainty that 
the available data do not differentiate sharply between the 
groups, It would appear that the entrance requirements for 
the Navy or the specific rating create a homogeneous group 
of personnel in terms of these variables. Since there 
appears to be nothing in the general biographic background 
of a member to distinguish between the groups, other 
posSible explanitory variables might be inspected. For 
instance, where waS a meaber's last duty station, what type 
of leaders did he have, was he training in a rating he 
liked? 


Although the discriminating variables were not found in 
the study, the use of the E-G exam aS a measure of 
effectiveness of a member's utility to the Navy still merits 
Study. Sharper discrimination might cesult if data 


reflecting actual test takers is used. 
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APPENDIX A 


List OF ORFERMINING FACTORS 


ACTOR TYPE RANGE 


——— a oe oe oe 


Dependents Binary O-No dependents 


1 - Dependents 


Age Interval 19-33 
Race Binary O0-Non-Caucasilan 
1 - Caucasian 
Sex Binary 0 - Female 
1 - Male 
A-School Bina ey C= NO k= Schoo 


1 - A-School 


Years of aducation Interval 9-16 
Education Binary O - No high 
Sere Lelcation school diploma 


i = ton seaog | 


diploma 
SrB (Gen. Intel.) Interval 31-74 
BTS (Num. Reason.) Interval 32-69 
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10. 


1. 


12. 


oe 


TH. 


> . 


BTB (Mech. Apt.) 
Boo (Clos Apt.) 
BT3B (Shop Apt.) 
Time in rate 
Time remaining 


Length orf service 


Interval 


Interval 


Interval 


Interval 


Interval 


Interval 
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Be ease 


39> 76 


37-74 


6-59 mos 


0-48 mos 


12-72 mos 
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APPENDIX B 


K GROUP TEST FOR SQUALITY OF GROUP MEANS 


0 71 2 k 
This implied ee i = ese Oe Wie ee oat 
1. J 
Lee wh pe ie = 0 
7 iz 
w (2) = H/2 +4 72 - 4H =O 
1 2 8 
w{k- 1) = i ie or: |) co aS et 8) Ser = 0 
1 k~1 
Now d =: _1 Yuw(i) = 0 


Which is the form of Anderson's test of the hypothesis 
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T (N-n-1 > F ( Q 
Sys N-n-1 


then rcaject the ejyuality of group means. 
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APPENDIX C 


K GROUP CLASSIFICATION FUNCTION 


WsslguerongeOupleg slr for allsother groups bh: 


E(x) JE we) >aTt/ boa ee: caves gh eps = 
g A Fes. 1g 


AsSign to group g if for all other groups h: 


-1 ~1 ~4 ~1 
2 ieee ol = Vier ba 
Bb g g 


~1 
eee erie re eee gy 
= o g h og 
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APPENDIX D 


TESTS OF EQUALITY OF GROUP MEANS 


EQUALITY OF VARIANCE EQUALITY OF MEANS 
F STATISTIC LEVEL $F STATISTIC LEVEL 
ALl Navy 173 Oe, 3.83 Be8, 
{2 Group) 
All Navy 2.34 0.0 16.69 0.0 
{3 Group) 
SN Clistr 1] los O20 939 DO eat, 
(2 Group) 
sN Clstr 1 P07 O20 8.15 GeG 
(3 Group) 
su Clstr 2 Lea oo vedo 122-36 
(2 Group) 
SN Clstr 2 Le bs O20 3.14 Oe e 
} fs Group) 
SN Clstr 4 1.98 J70 1.99 24000 
(2 Group) 
SN Clilstr 4 2205 0.0 bs. 25 02-0 
(3 Group) 
Br ZUG eau 747 94.70 
(2 Group) 
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BT i 3 0.0 
(3 Group) 


MM 1.90 0.0 
(2 Group) 
MM 2-10 0.0 


(3 Group) 
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APPENDIX & 


POWER OF DETERMINING FACTORS 


ALL NAVY 
2 GROUP 3 GROUP 

Race 24.50% Time remaining 32.03% 
Badu. Cert. 20.30% Time in service 12.20% 
Dependents 13.59% Dependents 7.84% 
Age iG. Sr Sex 7.45% 
ereriCclr. Apt. } 8.47% Bau. Cert. 6.99% 
BTB (Num. Reason.) 5.58% Time in rate 6.40% 
te (Mech. Apt.) 5.28% BES (Saoo Apna.) 6.04% 
a-School 4.59% BTB {Mech. Apt.) Cos boy. 4 
BTB {Gen Intel.) 4.59% A=Seunool 3.98% 
Years or edu. ties ie * Ble (oles apc.) 3.04% 
BTB (Shop Apt.) 0.48% Race 2.43% 
Sex Oe Bey Age 2.05% 

Years of edu. 1.5 te 

BTB (Gen. Intel.) 1.49% 

BTB (Num. Reas.) 1.35% 








SEAMEN GROUP 1 





2 GROUP 3 GROUP 

BTB (Shop Apt.) 29.43% Time remaining 35.72% 
_ 2BrB (ete. Apt) 17.88% BTB (Gen. Intel.) 10.95% 
BIB (Num. Reason.) 14.32% BTB (Shop Apt.) 8.82% 
BTB (Gen. Intel.) 8.55% Age 8.19% 
BTB (Mech. Apt.) 6.58% Time in rate 6.21% 
Dependents 6.07% BLae GUE. sADt«} Ga) s 
B-School 5.14% BES (heen. Apt.) 4.89% 
Age 4.23% Time in service 3.54% 
Race Se Hes A=sehocL 3.46% 
pdu. Cert. 3.14% Dependents 3.45% 
Yrs of edu. 1.33% Edusr cert. 3.38% 
BTS (Num. Reason.) 2.1384 

Yrs of edu. 1-93% 

Race gee Os“ 
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SEAMEN GROUP 2 





2 GROUP 3 GROUP 

A-School 17.30% Time remaining 37.63% 
BTB (Mech. Apt.) 14.98% A-School 6.76% 
eres (Cle. Apt.)} 13.35% Time in rate 6.69% 
Bro (Shop Apt.) Iie2on Dependents 6.24% 
YoS of edu. 97024 BTB (Num. Reason.) De los 
eau. cert. 9.17% Bie tle Beciniee 5 4% 
Sex 7.14% Yeror eau. 5.42% 
BTS (Gen. Intel.) 4.385% Time in service Del Z x 
BITS (Num. Reason.) 4.73% Race 4.52% 
Age Je bs Ble (Shop Apt.) 3.99% 
Dependents 2.40% BTB (Mech Apt.) 275704 
Race 1.80% Ble (Clr. Ape.) 3.49% 

BEB (Gen. Intel.) 2a Oe 

Sex 1.39% 


Age 1.054 











SEAMEN GROUP 4 


2 GROUP 3 GROUP 

Years of edu. 21.77% Time cemalning 29.27% 
Race 15.04% BTB (Shop Apt.) 12.09% 
BIB (Num. Reason.) 13.70% Time in service 8.24% 
BIB (Clr. Apt.} 10.46% Dependents 7.94% 
BTB (Shop Apt.) 10.314 Edu. cert. 7.14% 
Sex be 3% BTB (Gen. Intel.) 6.6 7 
Edu. cert. 6.01% A-Senoo! 4.39% 
Dependents 4.52% Years of edu. 4.15% 
BTB (Gen. Intel.) 3.31% BTB (Num. Reason.) 4.06% 
A-School a.32h Sex 3.97% 
Age 2.89% Race 3.94% 
BTB (H4ech. Apt.) 2.39% Tine in crate 2.58% 

BTB (ech Apt.) 1.96% 

Bee (edir vata) 1.79% 


Age ° 1.74% 











2 GROUP 


BTB (Gen. Intel.) 
BTB (Nug. 2eason.) 
ms (Clce. Apt.) 
Years or edu. 

mall, Cert. 
A-School 

BTB (Shop Apt.) 
Dependents 

BTS (Mech. Apt.) 


BOILER TECHNICIAN 


18.11% 
16.96% 
15.62% 
12.32% 
10.30% 
9.43% 
6.99% 
4.39% 
4.05% 
1.42% 
0.35% 


3 GROUP 


Time remaining 
Years of edu. 
BTB (SnOp “Apt. } 
Time in service 
Age 

BTB (Mech. Apt.) 
Bale CCL. 
A=SECHOOL 

BTB (Gen. Intel.) 
Time in rate 

BT3 (Num. &eason.) 


Dependents 


Race 
Brea /(SLe. SOG.) 


18.05% 
5.2395 
12.84% 
10.73% 
10.40% 
10.25% 
7.40% 
3.23% 
3.13% 
2<92% 
22.19% 
123 9m 
1.30% 
0.53% 








WACHiNES E'S MATE 


2 GROUP 3 GROUP 

BTB (Gen. Intel.) 29.91% Time Cemaining 29.56% 
BrD (Mech. Apt.) 26.44% BTB (Mech. Apt.) 15. 5e6 
Age 10.01% Bau. Cert. 11.17% 
BIB {Num. Beason.) 9.68% Time in service 8.30% 
Bib CLE. Sat.) 72.97% BES 46e0..,intel 2} 6.91% 
Dependents 56.15" Time in rate 6.26% 
Years or edu. 5-903% Bie {cilt., fot.) 5.29% 
A=-School 2.41% Age 4.654 
Race 1.70% Years of edu. 3.88% 
Edu. cert. 1.53% Race 3.05% 
Brs (Shop Apt.} 0.12% BTB {Snop Apt.) 1.380% 

BTB {Num. Reason.) eos 

A-School eG zee 


Dependents 0.23% 








ae VENUES 


RESULTS OP CLUSTERING OF SEAMEN RATINGS 


CLUSTEn. 4 ChuStiR 2 
STS - Sonar Tech (S) SK ~ Storekeeper 
SIG = Sonar Tech ({G) PC - Postal Clerk 
FTG - Fire Cntl Tech (gun) YN - Yeoman 
FTM - Fire Cntl Tech (missle) DK - Dispursing Clerk 
FTB - Fire Cnotl Tech (FBM) MS - Mess Specialist 
ETN - g£lectronics Tech (comm) SH - Ships serviceman 
Bra —- £lectronics Tech (radar) LI ~- Lithographer 
Ed - Elec darfare Tech CTA ~ Comm Techn (admin) 
CTM - Comm Tech (maint.) TM - Torpedoman's Mate 
PTI —- Comm Tech (interp.) BM - Boatswain's date 

_ oS - Data Systems Tech 


 — Missle Tech 








CLUSTER 3 CLUSTER 4 


GMG - Gunner's Mate (gua) CIT - Céag Tech (tech) 
G4M - Gunner's Mate (missles) CTO - Comm Tech (comn) 
GMT - Gunner's Mate (tech) CTR - Comm Tech {collection} 
MN - Mineman DP - Data Processing Tech 
OM - Opticalman IS - Intelligence Spec 
aA - Master at Aras PN - Personnelman 
a - Instrumentman RM - gadioman 
DM - Draftsman 
OM - Quartermaster 
OS - Operations Spec 
OFe-' GcSean sys spsee 
MU - Musician 
JO - Journalist 
LN - Legaiman 
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APPENDIX G 


Crass FP ECarion RESULTS 


SAMPLE & CORRECTLY CLASSIFIED 
Thor NOW TI2ST UGT OF 
TOTAL PASSERS PASSES SERVICE 

All Navy 62% 30% 45% iodine 
{2 Group) 
All Navy 63% Tis 39% 71% 
(3 Group) 
ex Clstr 1 74% 32% 88% == 
(2 Group) 
ey Clistr 1 69% 33% 79% 72h 
{3 Group) 
aay Cistrc 2 63% 77% 49% —— 
(2 Group) 

| SN Clstr 2 63% 71% 43% 76% 
{3 Group) 

{ 

| SN Clstr 4 67% 31% 50% -- 
(2 Group) 
BN Clstrc 4 69% 80% 42% 77% 
Group) 
BT 60% B6% 34% == 
me Group) 
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Br 57% 80% 28% 
(3 Group) 


MM O6% 711% 60% 
{2 Group) 
MM 66% 52% 58% 
{3 Group) 
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